Causality Networks

نویسنده

  • Ishanu Chattopadhyay
چکیده

While correlation measures are used to discern statistical relationships between observed variables in almost all branches of datadriven scientific inquiry, what we are really interested in is the existence of causal dependence. Statistical tests for causality, it turns out, are significantly harder to construct; the difficulty stemming from both philosophical hurdles in making precise the notion of causality, and the practical issue of obtaining an operational procedure from a philosophically sound definition. In particular, designing an efficient causality test, that may be carried out in the absence of restrictive pre-suppositions on the underlying dynamical structure of the data at hand, is non-trivial. Nevertheless, ability to computationally infer statistical prima facie evidence of causal dependence may yield a far more discriminative tool for data analysis compared to the calculation of simple correlations. In the present work, we present a new non-parametric test of Granger causality for quantized or symbolic data streams generated by ergodic stationary sources. In contrast to state-of-art binary tests, our approach makes precise and computes the degree of causal dependence between data streams, without making any restrictive assumptions, linearity or otherwise. Additionally, without any a priori imposition of specific dynamical structure, we infer explicit generative models of causal crossdependence, which may be then used for prediction. These explicit models are represented as generalized probabilistic automata, referred to crossed automata, and are shown to be sufficient to capture a fairly general class of causal dependence. The proposed algorithms are computationally efficient in the PAC sense; i.e., we find good models of cross-dependence with high probability, with polynomial run-times and sample complexities. The theoretical results are applied to weekly search-frequency data from Google Trends API for a chosen set of socially “charged” keywords. The causality network inferred from this dataset reveals, quite expectedly, the causal importance of certain keywords. It is also illustrated that correlation analysis fails to gather such insight.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CMDTS: The Causality-based Medical Diagnosis and Treatment System

Our medical world is replete with clinical data but this data is rarely automatically exploited for bringing more health to our society. Many researches have been conducted in Medical Data Mining, but almost all of them have focused on diagnosing the diseases not treating the patients. In this paper we propose the Causality-based Medical Diagnosis and Treatment System, which can be used to diag...

متن کامل

CMDTS: The Causality-based Medical Diagnosis and Treatment System

Our medical world is replete with clinical data but this data is rarely automatically exploited for bringing more health to our society. Many researches have been conducted in Medical Data Mining, but almost all of them have focused on diagnosing the diseases not treating the patients. In this paper we propose the Causality-based Medical Diagnosis and Treatment System, which can be used to diag...

متن کامل

A “Chicken & Egg” Network Coding Problem

We consider the multi-source network coding problem in cyclic networks. This problem involves several difficulties not found in acyclic networks, due to additional causality requirements. This paper highlights the difficulty of these causality conditions by analyzing two example cyclic networks which are structurally similar. Both networks have an essentially identical network code which appear...

متن کامل

Mutual Causation in Highway Construction and Economic Development

This paper investigates the relationship between the growth of road networks and regional development. We test for mutual causality between the growth of road networks (which are divided functionally into local roads and highways) and changes in county-level population and employment. We employ a panel data set containing observations of road mileage by type for all Minnesota counties over the ...

متن کامل

Stock Market Interactions between the BRICS and the United States: Evidence from Asymmetric Granger Causality Tests in the Frequency Domain

The interaction of BRICS stock markets with the United States is studied using an asymmetric Granger causality test based on the frequency domain. This type of analysis allows for both positive and negative shocks over different horizons. There is a clear bivariate causality that runs both ways between the United States stock market and the respective BRICS markets. In addition, both negative a...

متن کامل

Exploring the Trade Openness, Energy Consumption and Economic Growth Relationship in Iran by Bayer and Hanck Combined Cointegration and Causality Analysis

This paper aims to investigate the direction of causality between economic growth, energy consumption and trade openness in case of Iran for the period 1967–2012. We apply the newly developed combined cointegration test proposed by Bayer and Hanck (2013). Vector Error Correction Model (VECM) is applied to determine the direction of causality between these three variables. The result of Bayer-Ha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1406.6651  شماره 

صفحات  -

تاریخ انتشار 2014